Amendments to the Claims: 

This following listing of claims will replace all prior versions, and listings, of claims in 
the application. 

Listing of Claims: 

1. (previously presented) A system supporting speech recognition comprising: 

two or more clients, each client comprising the capability to receive audio speech from a 
user, store the audio speech in one or more buffers in a raw uncompressed audio format, each 
buffer comprising a portion of the received audio speech, encode a buffer of the received audio 
speech before all of the audio speech is received, package the encoded buffer to receive audio 
speech into one or more packets to be transmitted over the internet before all of the audio speech 
is received, and transmit a packet of encoded audio speech over the internet before all of the 
audio speech is received; and 

a server, the server comprising the capability to receive packets of encoded audio speech 
from at least two clients, decode each of the packets of audio speech and store the resultant raw 
speech into one or more buffers for the respective client, and evaluate the resultant raw speech 
received from each of the at least two clients. 

2. (previously presented) The system of claim 1, wherein the server further comprises the 
capability to transmit a response to a client, the response a result of the server's evaluation of the 
resultant raw speech received from the client, and 

a client of the two or more clients further comprises the capability to receive the response 
from the server. 

3. (previously presented) The system of claim 2, wherein the response is a text response, 
and a client of the two or more clients comprises a screen on which the client displays the text 
response. 

4. (canceled) 
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5. (currently amended) The system of claim I, wherein the server further comprises two 
or more stored text format files, and the server selects a stored text format file to transmit to a 
client of the two or more clients as a result of the server's evaluation of the resultant raw speech 
received from the client , and the server adjusts a processing time used to evaluate the resultant 
raw speech based on a value in a URL connection between the client and the server . 

6. (previously presented) The system of claim 5, wherein the server further comprises the 
capability to partition a stored text format file into two or more packets for the transmission over 
the Internet, and to transmit each packet over the Internet to a client. 

7. (canceled) 

8. (original) The system of claim 1 wherein the one or more buffers comprise a linked list 
of buffers. 

9. (currently amended) A system supporting speech recognition comprising: 

two or more clients, each client comprising the capability to receive audio speech from a 
user, store the audio speech in one or more buffers organized as a linked list in a raw 
uncompressed audio format, each buffer comprising a portion of the received audio speech, 
encode a buffer of the received audio speech before all of the audio speech is received, package 
the encoded buffer to receive audio speech into one or more packets to be transmitted over a 
network before all of the audio speech is received, and transmit a packet of encoded audio 
speech over the network before all of the audio speech is received; and 

a server, the server comprising the capability to receive packets of encoded audio speech 
from at least two clients, decode each of the packets of audio speech and store the resultant raw 
speech into one or more buffers for the respective client, and evaluate the resultant raw speech 
received from each of the at least two clients , wherein a level of processing used in the 
evaluation of the resultant raw speech received from each of the at least two clients is alterable 
based on a value communicated between the clients and the server. 
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10. (original) The system of claim 9 wherein the encoded audio speech is in a compressed 

format. 

1 1 . (previously presented) A system supporting speech recognition comprising: 

two or more clients, each client comprising the capability to receive audio speech from a 
user, store the audio speech in a first set of one or more buffers in a raw uncompressed audio 
format, each buffer comprising a portion of the received audio speech, write the stored audio 
speech from a first buffer in the first set of buffers to a second buffer in a second set of one or 
more buffers, encode the stored audio speech in the second buffer before all of the audio speech 
is received, package the encoded second buffer into one or more packets to be transmitted over 
the internet before all of the audio speech is received, and transmit a packet of encoded audio 
speech over the internet before all of the audio speech is received; and 

a server, the server comprising the capability to receive packets of encoded audio speech 
from at least two clients, decode each of the packets of audio speech and store the resultant raw 
speech into one or more buffers for the respective client, and evaluate the resultant raw speech 
received from each of the at least two clients. 

12. (currently amended) The system of claim 11, wherein the server further comprises the 
capability to transmit a response to a client, the response a result of the server's evaluation of the 
resultant raw speech received from the client, and the server alters a processing time used to 
evaluate the resultant raw speech based on a value communicated between the client and the 
server, and 

a client of the two or more clients further comprises the capability to receive the response 
from the server. 

13. (currently amended) The system of claim [[12]] ]J_, wherein the response is a text 
response, and a client of the two or more clients comprises a screen on which the client displays 
the text response wherein based on a URL sent by the client, the server determines whether to 
expect speech data for processing from the client . 
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14. (previously presented) The system of claim 12, wherein the response is in text format, 
and a client of the two or more clients comprises a text-to-speech engine which converts a text 
format response to audio data, and an audio output device that the client uses to output the audio 
data to a user. 

15. (previously presented) The system of claim 1 1, wherein the server further comprises 
two or more stored text format files, and the server selects a stored text format file to transmit to 
a client of the two or more clients as a result of the server's evaluation of the resultant raw 
speech received from the client. 

16. (original) The system of claim 15, wherein the server further comprises the capability 
to partition a stored text format file into two or more packets for the transmission over the 
internet, and to transmit each packet over the internet to a client. 

17. (original) The system of claim 16, wherein a client further comprises an audio output 
device, and the capability to receive the packets of text format, convert the packets of text format 
to audio data and play the audio data to a user. 

18. (original) The system of claim 1 1 wherein the first set of one or more buffers 
comprises a linked list of buffers. 

19. (original) The system of claim 1 1 wherein the second set of one or more buffers 
comprises a linked list of buffers. 

20. (original) The system of claim 1 1 wherein the second set of one or more buffers 
comprises a predefined number of buffers. 

21. (original) The system of claim 1 1 wherein the encoded audio speech is in a 
compressed format. 
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22. (previously presented) The system of claim 1 wherein a user selects a user objective 
at a client, the client transmits the user objective to the server, and the server evaluates the 
resultant raw speech received from the client based on the user objective. 

23. (previously presented) The system of claim 22 wherein the user objective comprises 
pronunciation accuracy. 

24. (previously presented) The system of claim 22 wherein the user objective comprises 
grammar. 

25. (currently amended) The system of claim 1 wherein before the client receives audio 
speech from a user, the server transmits a file to a client, and the client presents the file in an 
audio format to the user wherein a user selects a user objective at a client, the client transmits the 
user objective to the server, and the server evaluates the resultant raw speech received from the 
client based on the user objective and a value communicated to the server by URL . 

26. (currently amended) The system of claim 1 wherein before the client receives audio 
speech from a user, the server transmits a file to a client, the client presents the file in at least one 
of an audio or visual format to the user, and the server evaluates the resultant raw speech 
received from the client in connection with the file transmitted from the server to the client and a 
processing time used to evaluate the resultant raw speech will vary based on a value 
communicated to the server from the client . 

27. (currently amended) The system of claim 1 wherein the server transmits a first file to 
a client, the client presents the first file in at least one of an audio or visual format to the user, 
after presenting the first file to the user, the client receives audio speech from the user, and 

the server evaluates the resultant raw speech received from the client in connection with 
the first file transmitted from the server to the client and a processing time used by the server to 
evaluate the resultant raw speech is alterable based on a value communicated from the client to 
the server. 
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28. (previously presented) The system of claim 27 wherein the server transmits a second 
file to the client, the client presents the second file in at least one of an audio or visual format to 
the user, after presenting the second file to the user, the client receives audio speech from the 
user, and 

the server evaluates the resultant raw speech received from the client in connection with 
the second file transmitted from the server to the client. 



Page 7 of 14 



